Lessons for the Future from a Decade of Informedia Video Analysis Research
Identifieur interne : 001283 ( Main/Exploration ); précédent : 001282; suivant : 001284Lessons for the Future from a Decade of Informedia Video Analysis Research
Auteurs : G. Hauptmann [États-Unis]Source :
- Lecture Notes in Computer Science [ 0302-9743 ] ; 2005.
Abstract
Abstract: The overarching goal of the Informedia Digital Video Library project has been to achieve machine understanding of video media, including all aspects of search, retrieval, visualization and summarization in both contemporaneous and archival content collections. The base technology developed by the Informedia project combines speech, image and natural language understanding to automatically transcribe, segment and index broadcast video for intelligent search and image retrieval. While speech processing has been the most influential component in the success of the Informedia project, other modalities can be critical in various situations. Evaluations done in the context of the TRECVID benchmarks show that while some progress has been made, there is still a lot of work ahead. The fundamental “semantic gap” still exists, but there are a number of promising approaches to bridging it.
Url:
DOI: 10.1007/11526346_1
Affiliations:
Links toward previous steps (curation, corpus...)
- to stream Istex, to step Corpus: 001380
- to stream Istex, to step Curation: 001300
- to stream Istex, to step Checkpoint: 000B92
- to stream Main, to step Merge: 001319
- to stream Main, to step Curation: 001283
Le document en format XML
<record><TEI wicri:istexFullTextTei="biblStruct"><teiHeader><fileDesc><titleStmt><title xml:lang="en">Lessons for the Future from a Decade of Informedia Video Analysis Research</title>
<author><name sortKey="Hauptmann, G" sort="Hauptmann, G" uniqKey="Hauptmann G" first="G." last="Hauptmann">G. Hauptmann</name>
</author>
</titleStmt>
<publicationStmt><idno type="wicri:source">ISTEX</idno>
<idno type="RBID">ISTEX:8EAEE8462A4CFC7C03932537AF5AAD784FEE76B5</idno>
<date when="2005" year="2005">2005</date>
<idno type="doi">10.1007/11526346_1</idno>
<idno type="url">https://api.istex.fr/document/8EAEE8462A4CFC7C03932537AF5AAD784FEE76B5/fulltext/pdf</idno>
<idno type="wicri:Area/Istex/Corpus">001380</idno>
<idno type="wicri:Area/Istex/Curation">001300</idno>
<idno type="wicri:Area/Istex/Checkpoint">000B92</idno>
<idno type="wicri:doubleKey">0302-9743:2005:Hauptmann G:lessons:for:the</idno>
<idno type="wicri:Area/Main/Merge">001319</idno>
<idno type="wicri:Area/Main/Curation">001283</idno>
<idno type="wicri:Area/Main/Exploration">001283</idno>
</publicationStmt>
<sourceDesc><biblStruct><analytic><title level="a" type="main" xml:lang="en">Lessons for the Future from a Decade of Informedia Video Analysis Research</title>
<author><name sortKey="Hauptmann, G" sort="Hauptmann, G" uniqKey="Hauptmann G" first="G." last="Hauptmann">G. Hauptmann</name>
<affiliation wicri:level="4"><country xml:lang="fr">États-Unis</country>
<wicri:regionArea>School of Computer Science, Carnegie Mellon University, 15213, Pittsburgh, PA</wicri:regionArea>
<placeName><region type="state">Pennsylvanie</region>
<settlement type="city">Pittsburgh</settlement>
</placeName>
<orgName type="university">Université Carnegie-Mellon</orgName>
</affiliation>
<affiliation wicri:level="1"><country wicri:rule="url">États-Unis</country>
</affiliation>
</author>
</analytic>
<monogr></monogr>
<series><title level="s">Lecture Notes in Computer Science</title>
<imprint><date>2005</date>
</imprint>
<idno type="ISSN">0302-9743</idno>
<idno type="eISSN">1611-3349</idno>
<idno type="ISSN">0302-9743</idno>
</series>
<idno type="istex">8EAEE8462A4CFC7C03932537AF5AAD784FEE76B5</idno>
<idno type="DOI">10.1007/11526346_1</idno>
<idno type="ChapterID">1</idno>
<idno type="ChapterID">Chap1</idno>
</biblStruct>
</sourceDesc>
<seriesStmt><idno type="ISSN">0302-9743</idno>
</seriesStmt>
</fileDesc>
<profileDesc><textClass></textClass>
<langUsage><language ident="en">en</language>
</langUsage>
</profileDesc>
</teiHeader>
<front><div type="abstract" xml:lang="en">Abstract: The overarching goal of the Informedia Digital Video Library project has been to achieve machine understanding of video media, including all aspects of search, retrieval, visualization and summarization in both contemporaneous and archival content collections. The base technology developed by the Informedia project combines speech, image and natural language understanding to automatically transcribe, segment and index broadcast video for intelligent search and image retrieval. While speech processing has been the most influential component in the success of the Informedia project, other modalities can be critical in various situations. Evaluations done in the context of the TRECVID benchmarks show that while some progress has been made, there is still a lot of work ahead. The fundamental “semantic gap” still exists, but there are a number of promising approaches to bridging it.</div>
</front>
</TEI>
<affiliations><list><country><li>États-Unis</li>
</country>
<region><li>Pennsylvanie</li>
</region>
<settlement><li>Pittsburgh</li>
</settlement>
<orgName><li>Université Carnegie-Mellon</li>
</orgName>
</list>
<tree><country name="États-Unis"><region name="Pennsylvanie"><name sortKey="Hauptmann, G" sort="Hauptmann, G" uniqKey="Hauptmann G" first="G." last="Hauptmann">G. Hauptmann</name>
</region>
<name sortKey="Hauptmann, G" sort="Hauptmann, G" uniqKey="Hauptmann G" first="G." last="Hauptmann">G. Hauptmann</name>
</country>
</tree>
</affiliations>
</record>
Pour manipuler ce document sous Unix (Dilib)
EXPLOR_STEP=$WICRI_ROOT/Ticri/CIDE/explor/OcrV1/Data/Main/Exploration
HfdSelect -h $EXPLOR_STEP/biblio.hfd -nk 001283 | SxmlIndent | more
Ou
HfdSelect -h $EXPLOR_AREA/Data/Main/Exploration/biblio.hfd -nk 001283 | SxmlIndent | more
Pour mettre un lien sur cette page dans le réseau Wicri
{{Explor lien |wiki= Ticri/CIDE |area= OcrV1 |flux= Main |étape= Exploration |type= RBID |clé= ISTEX:8EAEE8462A4CFC7C03932537AF5AAD784FEE76B5 |texte= Lessons for the Future from a Decade of Informedia Video Analysis Research }}
This area was generated with Dilib version V0.6.32. |